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We report the observation of the decay B~ Di*^~^ K~ £~Ve based on 342 fb~^ of data collected 
at the T{4S) resonance with the BABAR detector at the PEP-II e'^e~ storage rings at SLAG. A 
simultaneous fit to three D'^ decay chains is performed to extract the signal yield from measurements 
of the squared missing mass in the B meson decay. We observe the decay B~ — > Di*'''^ K~ £~Ve with 
a significance greater than five standard deviations (including systematic uncertainties) and measure 
its branching fraction to be B(B" Di*'+/r-£-I7«) = [6.13ll:o3(stat.)±0.43(syst.)±0.51(B(-D3))]x 
lO"'', where the last error reflects the limited knowledge of the Ds branching fractions. 

PACS numbers: 13.20.He, 12.15.Ji, 12.38.Qk 



The study of charmed inclusive semileptonic B meson 
decays enables the measurement of the CKM matrix ele- 
ment \Vcb\- This measurement relies on a precise knowl- 
edge of all semileptonic B meson decays. Decays of or- 
bitally excited D mesons, from the process B — >■ D**£v, 
constitute a significant fraction of these decays [1], and 
may help explain the discrepancy between the inclusive 
B — )• Xc^v rate, where is a charmed hadronic fi- 



nal state, and the sum of the measured exclusive decay 
rates [1, 2]. So far, analyses of these decays have fo- 
cused on the reconstruction of B — >■ D'^*^itIv states [3-5]. 
In such analyses, experimental data are interpreted as a 
sum of the four D** resonances. The results show the 
dominance of B decays to broad resonances, while QCD 
sum rules imply the opposite [6]. Conversely, a small 
contribution from broad D** states implies the presence 
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of a non-rcsonant B — > D^*^t:£i> component, which has 
not yet been observed. Measurement of the branching 
fraction for the as-yet-unobserved B~ — > Di*^~^ K~£~Vi 
decay [7] would provide additional information relevant 
to this issue, by exploring the hadronic mass distribu- 
tion above 2.46 GeV/c^ where resonant and non-resonant 

components are present. In addition, the measurement 

_ (*)+ 

of i? Ds K i will provide a better estimate 

of background in future studies of semileptonic Bg — > 

DfX£~T7^ decays. 

Using the shape of the hadronic mass spectrum in B 
semileptonic decays, a rough estimate on the branching 
fraction Di*^^K-£-Ve) is of the order of IQ-^ 

[8, 9], which is consistent with the limit set by the AR- 
GUS Collaboration, B{B- Di*^+K-i-Ve) < 5 x lO'^ 
at 90% confidence level [10]. A comparison between this 
expectation and the actual measurement can confirm or 
refute the expected rapid decrease of the hadronic mass 
distribution at high values. 

In this paper, we present the observation of i?~ — >■ 
Ds K~£~vi decays, where £ = e, ^i. This analysis does 
not differentiate between final states with Df and f*^, 
where Z)*+ decays via emission of neutral decay prod- 
ucts that are not reconstructed. The results are based 
on a data sample of Nj^g = (376.9 ± 4.1) x lO'' BB 
pairs recorded at the T(4S) resonance with the BaBAR 
detector [11] at the PEP-II asymmetric energy e+e" stor- 
age rings at the SLAG National Accelerator Laboratory. 
This corresponds to an integrated luminosity of 342 fb~"^. 
In addition, 37fb~^ of data collected about 40McV be- 
low the resonance are used for background studies. A 
GEANT-based Monte Carlo (MC) simulation [12] of BB 
and continuum events (e+e~ — > qq with q = u, d, s, c) is 
used to study the detector response and acceptance, val- 
idate the analysis technique, and evaluate signal efficien- 
cies. The sample of simulated BB events is equivalent 
to approximately 3 times the data sample. The signal 
MC events arc generated by adapting the decay model of 
Goity and Roberts [13] to describe DfK~ final states. 
Two alternative signal MC samples are used to estimate 
systematic uncertainties: a sample based on the ISGW2 
model [14], in which B~ mesons decay to D'q'£~v with 
Dq^ — > DfK^, and a sample based on a simple phase 
space model. The signal MC samples are equivalent to 
approximately 10 times the expected signal yield. 

We reconstruct candidates in three decay chains: 
Dj (j)-K+ with (j) -)■ K+K-, D+ K*^K+ with 
K-Tr+, and D+ K",K+ with ifO ^ tt+tt". 
The K*°, and K° candidates are formed by combin- 
ing oppositely charged tracks. To suppress combinato- 
rial background from the £>+ reconstruction in the first 
two decay chains, we employ a feed-forward neural net- 
work (multilayer perceptron, MLP [15]) with three in- 
put variables and four hidden layers. The input vari- 
ables are the absolute value of the difference between 



the reconstructed and the nominal mass values of the 
(p/K*^ candidate [1], the absolute value of the cosine 
of the helicity angle of the (f>/K*^, and the proba- 
bility of the fit to the Df candidate. The helicity an- 
gle is defined as the angle between the candidate 
and one kaon originating from the (p/K*^ in the (j)/K*'^ 
rest frame. To suppress combinatorial background in the 
K^K^ decay chain, we require the invariant mass 
of the charged pions forming the Kg candidate to satisfy 
0.490 GcV/c2 < mimr) < 0.506 GcV/c^, the fiight length 
of the Kg to be larger than 1 mm, the cosine of the lab- 
oratory angle between the Kg momentum and the line 
connecting the K'^ decay vertex and the primary vertex 
of the event to be positive, and the probability of the 
Df candidate's vertex fit to be larger than 0.001. The 
selection criteria are optimized to maximize the statisti- 
cal significance of the signal. No requirement on the mass 
of the Df candidates is applied, since this distribution is 
used to extract the signal yield. 

A lepton and a kaon, both with negative charge, are 
combined with the Df candidate to form a B~ candi- 
date. Leptons are required to have momentum \pi\ larger 
than 0.8GeV/c [16] to reject those not directly originat- 
ing from B mesons. The probability of the vertex fit of 
the B candidate is required to be larger than 0.01. 

Three event-shape variables that are sensitive to the 
topological differences between jet-like continuum events 
and more spherical BB events are used as input to a 
neural network to suppress background from continuum 
events. These variables are the normalized second Fox- 
Wolfram moment i?2 [17], the monomial L2 [18], and the 
cosine of the angle between the flight direction of the 
reconstructed B candidate and the rest of the event. A 
neural network whose input variables are the B candidate 
mass, the B candidate sphericity, and the thrust value of 
the rest of the event, is used to reduce the background 
from other B decays, providing a slight, but not negligible 
improvement in the sensitivity of the measurement. 

After applying these selection criteria, the remaining 
background events are divided into two classes, depend- 
ing on whether or not they contain a correctly recon- 
structed Df meson. The first class is the more impor- 
tant of the two. We refer to it as Df background events 
in the following. Most of these events contain a Dg orig- 
inating from decays such as B — >■ DgD, where the kaon 
and lepton tracks used to form a B candidate are taken 
from the other B meson in the event. The angular corre- 
lation between the flight directions of the Dg and the D 
is used to suppress the Dg background candidates. The 
direction of the D meson is estimated from the direction 
of a previously unused charged or neutral kaon candidate 
that is assumed to be from D — > K^'^^X decays. Rcqiur- 
ing the cosine of the angle between the flight direction 
of the Dg candidate and the additional kaon to be larger 
than —0.5, about 30% of the D^-background events are 
rejected, as shown in Fig. 1. About 8% of the remaining 
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FIG. 1: Angular distribution of the cosine of the angle 
between the Bight direction of the meson and addi- 
tional charged and neutral kaons: (a) cosQ{DfK^) and (b) 
cos Q{DfKs). Solid lines represent signal MC events; dashed 
lines arc Df background. The vertical lines indicate the se- 
lection applied. 



events have multiple candidates, predominantly two. In 
such cases, we choose the candidate with the largest B 
vertex fit probability. 

The remaining events are divided into signal regions 
and sidebands based on the mass of the Df candidate. 
The sidebands arc defined by 1.9GcV/c^ < ■m{Df) < 
1.94GeV/c2 and 2.0GeV/c2 < m{D+) < 2.04GeV/c^. 
Fits to the Df mass distributions are performed sepa- 
rately for each decay channel to define the signal regions, 
and to measure the number of reconstructed mesons, 
which are used later for extracting the signal yield. The 
signal regions are defined as ±2.5(t wide bands, centered 
on the "fitted means" for each decay channel. Signal 
events are identified by the missing mass of the visible 
decay products Y = DfK~£~ with respect to the nom- 
inal B meson mass: 



MI = {Eb-Ey)' 



(1) 



where Eb is the beam energy, corresponding to the en- 
ergy of the B meson, while Ey and pV represent the 
energy and momentum of the Y composite, respectively. 
Due to its smallness and unknown direction, the momen- 
tum of the B meson is neglected. This leads to a distri- 
bution for Afj^j with a Gaiissian shape for correctly re- 
constructed signal events. Other B scmileptonic decays, 
where one particle is not reconstructed or is erroneously 
included, lead to higher or lower values of M^. 

To extract the signal yield, we perform an unbinned 
extended maximum- likelihood fit, applied simultaneously 
to the distributions of the signal region and the side- 
bands of the three decay chains. While the sidebands 
are populated only by combinatorial background events, 
the signal region also contains background and sig- 
nal events. Because their lepton acceptances differ, the 
electron and muon channels are fitted separately. The 
combinatorial background is modeled using a sum of two 
Gaussian distributions whose parameters are the same 
for the three Df decay chains. This parameterization is 
favored by MC simulation. This fit technique is equiva- 
lent to a sideband subtraction. The contributions of Dt 



background events are modeled using a Fermi function: 

I 



(2) 



where Mq represents the Mf-^ drop-ofF value and Ec the 
smearing of the Fermi-edge. The values for Mq and 
Ec, M§ = (0.303 ± 0.034) GeV^/c* and Ec = (0.333 ± 
0.018) GeV^c^ for the electron channel, and Mq = 
(0.247±0.041) GeV7c'' and Ec = (0.346±0.022) GeV7c^ 
for the muon channel, are fixed to the values derived from 
fits to MC distributions and arc the same for all Df de- 
cay chains. Signal events are modeled by a Gaussian 
distribution, with the same mean and width for all re- 
construction channels. The width is fixed to the value 
determined from the simulation. The mean of the distri- 
bution is determined in the fit, allowing for contributions 
from events with a D*~^ in the final state. 

The total number of events with a and the number 
of combinatorial background events in the signal region 
have been determined from fits to the m[D'^) distribu- 
tions. The number of signal B~ Di*^~^ K~i~V£ and 
Df background events are extracted from the fits to the 
M^ distributions, separately for the electron and muon 
samples. For these fits the three Df decay channels are 
combined, taking into account their detection branching 
fractions ebb. = B{D+ D\d2) x B{Dl -J> d'idA) and 
individual reconstruction efficiencies ereco- For illustra- 
tion, these efficiencies and the branching ratios are listed 
in table I, together with the total fitted number of signal 
events and the estimated contributions from each of the 
three channels. 

The fit is performed in the range jM^^J < l.SGcV^C* 
has 10 free parameters: the mean value of M^, the to- 
tal number of fitted signal events AT^'snai^ parameters 
that describe the shape of the combinatorial background, 
and three sideband normalization parameters. The num- 
ber of signal and £>+ background events are free in the fit, 
only the sum of both values is constrained to the result 
of the fits to the m{Df) distributions. 

The likelihood function is 



AT, 



n! 



.(^signal)n-Q]-[p(^2 (3) 



with Nj the number of events and ViM^ ^ aj ) the proba- 
bility density function (PDF) for a given fit slice j (signal 
region or sideband of each _D+ decay chain) with the fit 
parameters a, and n — iVj the total number of events. 

Using MC experiments from a generator, which in- 
cludes parameterizations of detector performance for sig- 
nal reconstruction and background expectations, it has 
been verified that the fit is able to extract signal branch- 
ing fractions for B{B- -)• B^f^"^ R- l-y i) > 3 x 10"^. 
Values of fit biases are also determined with this proce- 
dure and are taken into account in the analysis. 
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TABLE I: Signal yields, selection efficiencies ereco, and branching fractions esR = B{D+ Dld2) X B{D1 d3d4) for tfie 
individual and combined decay chains. The signal yields of each decay chain arc computed using iV"'«5"''' and the efficiencies 
and are given for illustration only. The errors on the signal yields are the fit errors, the uncertainties of Creco are the systematic 
uncertainties aud tlu^ uiu t-rtaiiitics of cbr represent tli(> liiuit(>d kncnvk-d^c of the braucliiiig fractions of tlit^ . 



Df decay chain 


Tysignal 
electron 


^reco, electron [%] 


lysignal 
^ ' muon 


^reco, muon [^] 


CBR [%] 


all 


259.41^^1 










D+ 07r+, <j) K+K- 




(2.76 ± 0.08) 


qo 1 +23.3 
± 22.9 


(1.62 ±0.06) 


(2.18 ± 0.33) 


D+ K*°K+, K*° K--n-+ 


Qc fj+22.2 
oO.^ 22 1 


(1.79 ± 0.06) 


70. 2l^^!;!* 


(1.09 ±0.05) 


(2.60 ± 0.40) 


D! ^ K'iK . iv^! ^ TT - 




(2.1)8 = ().()«) 


17 J 


(1.78 = ().()()) 


(1.02 = I). 09) 



Fit results are given in Table I. Reconstruction efficien- 
cies for the tliree decay chains are obtained by counting 
simulated signal events in the range |M^| < 1.2 GcV^/c"'. 
As reported in Ref. [19] the reconstruction efficiency of 
the Df (/>7r+ decay chain depends on the requirement 
on the 4> mass. The impact of this effect is covered by 
the systematic uncertainties on cbr- Figure 2 shows the 
sideband subtracted distributions summed over the 
decay channc^k. 

The bias-corrected signal yields are -^gif^tron ~ ^Oltgy 
and N^f^f = '206^11- The bias correction is +42 (-4) 
events for the electron (muon) channel. Extended simu- 
lations showed that the source of the bias is a fluctuation 
of the underlying combinatorial background distribution. 



TABLE II: Additive systematic uncertainties in events. 




[GeV^/c*] 



FIG. 2: Sideband subtracted distributions witli fitted 
functions superimposed: (a) for the electron channel and (b) 
for the muon channel. All Df reconstruction chains have been 
summed. Solid lines represent the full distribution, dashed 
lines are the background component and dotted lines rep- 
resent the fitted signal component. 

The systematic uncertainties are divided into two cat- 
egories: additive uncertainties (Table II) are related to 
the number of extracted signal events, while multiplica- 
tive uncertainties (Table III) are related to the calcu- 
lated branching fraction. The uncertainty due to the Df 
daughter branching fractions is quoted separately. 

The systematic uncertainty arising from the choice of 
the Df background PDF is evaluated using 1000 statis- 
tically independent MC experiments. Each experiment 
corresponds to different values for the two parameters 
that describe the PDF, Mq and Ec, which are distributed 
according to the error matrix for these parameters. We 
take the width of a Gaussian fitted to the resulting A^'^'snai 



source 


AiVelec. [Evts] 


AiVmuon [Evts] 


Df bkg parameterization 


19.9 


15.9 


Single channel signal yields 


14.5 


9.0 


Width of the signal PDF 


3.9 


4.3 


Error of the m.{Df) fits 


3.6 


3.4 


Total, affecting significance 


25.2 


19.1 


Bias correction 


2.2 


1.8 


Total uncertainty 


25.3 


19.2 



distribution as a systematic uncertainty. The impact of 
shape differences between data and MC have been stud- 
ied, as well as shape differences due to varying composi- 
tions of the Df background, and both foimd to be negli- 
gible. A similar procedure is used to estimate the uncer- 
tainty due to using the Df branching fractions esR for 
the combination of the individual channel signal yields. 
MC samples of esR are produced for each decay channel 
using the information of Ref. [1] . This leads to differences 
in the total number of extracted signal events. The width 
of a Gaussian fitted to the resulting distribution of signal 
yields is taken as systematic uncertainty. 

The width of the Gaussian PDF of for signal, as 
well as the number of fitted Df, are varied by ±la to 
evaluate these systematic uncertainties. This approach 
also takes into account the variation of the width due to 
a contribution of D*"*" to the signal yield. The systematic 
uncertainty related to the bias correction is given by the 
statistical imcertainty of the correction. 

We evaluate the uncertainty of the signal MC model 
by calculating the difference of the efficiencies between 
the alternative signal models and the Goity-Roberts sig- 
nal MC model. The impact of the finite statistics of the 
simulated signal sample is deduced from the uncertainty 
on the efficiency determination. The uncertainty arising 
from particle identification, as well as from the K'^ re- 
construction, is determined using dedicated high purity 
control samples for the corresponding particles. Uncer- 
tainties arising from track and photon reconstruction, as 
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well as from radiative corrections, are evaluated by vary- 
ing their reconstruction efficiencies and the energy radi- 
ated by photons in the simulation. The uncertainty on 
the number of B mesons in the data set is determined 
as described in Ref. [20] and the -D+ daughter branching 
fraction uncertainties are taken from Ref. [1]. 



TABLE III: Multiplicative systematic uncertainties in per- 
cent. 



sourc:c 


syst. uncor 


electron ( 


rmiou) ch. [%] 




o- 


K"'K 




Signal MC model 


7.6 (0.2) 


3.1 (6.4) 


5.9(2.1) 


N(Signal MC) 


2.7(3.5) 


3.3 (4.2) 


2.5(3.3) 


Particle ID 


0.6(1.6) 


1.2(4.9) 


3.6(7.9) 


K°s eff. 


-(-) 


-(-) 


2.0(3.1) 


Tracking eff. 


0.4 (0.1) 


0.5(0.2) 


1.8(2.4) 


Photon eff. 


0.6 (0.9) 


0.4 (0.9) 


0.5(0.7) 


Radiative corr. 


2.0(2.1) 


2.2(2.5) 


1.9(1.9) 


Total Aereco 


8.4 (4.5) 


5.2(9.5) 


8.1 (9.9) 


B counting 


1.1(1.1) 


1.1(1.1) 


1.1(1.1) 


B{Dt) 


15.1(15.1) 


15.4(15.4) 


6.0 (6.0) 



A second fit, imposing an A^'^'snai „ q hypothesis, is 
used to estimate the significance of the measurement. 
Since the mean of the Gaussian is a free parameter in the 
signal PDF, the difference in the number of free parame- 
ters (ANDF) of the fits is larger than one. As shown in 
Ref. [21], the resulting probability distribution cannot be 
approximated by a chisquare distribution with an integer 
number of degrees of freedom. Thus, only a significance 
range, representing the significances for ANDF = 2 and 
ANDF = 1, is calculated. Including statistical and sys- 
tematic uncertainties, the ranges arc [3.3 — 3.7]ct and 
[3.5 — 3.9]cr for the electron and muon channel, respec- 
tively. Combining both lepton channels results in a sig- 
nificance larger than 5. Oct. 

The branching fractions for the individual lepton chan- 
nels are B{B- oi*'^'^ K' p-Ve) = [5.8ll}-i^o(stat.) ± 
0.54(syst.) ± 0A9{B(D,))] x lO""' and bIb^ 



Di*^+K- 



lj,-Vf,) = [6.68ti:^^(stat.) ± 0.69(syst.) ± 
0.56(B(Ds))] X 10^'', where the last uncertainty reflects 
the limited knowledge of the Dg branching fractions. 
The measurements are combined, taking into account 
the correlations between their systematic uncertainties, 
yielding B{B- D^*^^K-£-Ve) = [6.13ti !^3(stat.) ± 
0.43(syst.) ± 0.51(S(Ds))] x 10"^. 

In summary, using a data sample of about 376.9 mil- 
lion BB pairs, we find evidence for the decay B^ 
Da K i i^i. The signal has a significance larger than 
S.Oct, after taking systematic effects into account. The 
measured branching fraction, B{B~ — > D^*'^'^ K~ i~V() = 
[6.13;j:23(stat.)±0.43(syst.)±0.51(i3(L's))] x 10-^ where 



the last uncertainty reflects the limited knowledge of the 
Ds branching fractions, is consistent with the previous 
upper limit reported by the ARGUS collaboration and 
with theoretical expectations. 
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